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ABSTRACT 



A method and system for distributing a service request 
among a plurality of servers is disclosed. A portion metric is 
assigned to each one of the plurality of servers, the portion 
metric designating a portion of total server requests to be 
allocated to the one of the plurality of servers. A server 
request may then be received. A total number of server 
requests processed by the plurality of servers is incremented 
and a number of server requests distributed to each one of 
the plurality of servers is maintained. The server request is 
then distributed to one of the plurality of servers using the 
portion metric assigned to each one of the plurality of 
servers, the number of server requests distributed to each 
one of the plurality of servers, and the total number of server 
requests. 

37 Claims, 6 Drawing Sheets 
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APPARATUS AND METHOD FOR cannot adequately maximize the throughput of a distributed 

PERFORMING TRAFFIC REDIRECTION IN system having servers with heterogeneous load capabilities. 

A DISTRIBUTED SYSTEM USING A It would be desirable if a metric were provided such that 

PORTION METRIC server requests could be distributed in accordance with the 

5 load capacity of each server. 

CROSS-REFERENCE TO RELATED Every host and router on the Internet has an IP address. 

APPLICATION The Domain Name System (DNS) is often used to map host 

This application is related to U.S. application Ser. No. names to these IP addresses. By way of example, a client 

09/175,673 Gled on the same day entitled "A System and typically sends a DNS query to a DNS server which includes 

Method for Determining a Preferred Service in a Network", 30 a host name and an indication that an IP address is requested, 

and U.S. application Ser. No. 09/175,516 filed on the same ^ ™S server then returns an IP address associated with 

day and entitled "A System and Method for Information me D0St Dame - l{ would therefore be desirable if a DNS 

Retrieval Regarding Services". server were designed to accomplish load distribution com- 
patible with the Internet and DNS. 

BACKGROUND OF THE INVENTION 15 l n addition to ensuring adequate load distribution, it is 

1 Field of the Invention necessary to accommodate for shifts of information. Infor- 

' ... „ «- i- - mation is commonly transferred from one web server to 

The present invention relates generally to traffic redirec- anQther web servef on the Inteme , Ty icall a web XIvel 

tion in a distributed system More particular^, the present ^ redirect a ^ tQ a new loc ^ Qn due tQ ^ fa ica , 

invention relates to a method and apparatus for redirecting 20 sbift of Momitioa between web However, the host 

traffic among a number of servers using a portion metnc for of ^ cliem fe typicaUy M taken imo am ^ en&)n during 

eac server. selection of the web server in providing this redirect. In 

2. Description of the Related Art many instances it would be desirable if the information 

A computer network may be defined as an interconnected could be selectively varied according to the particular client 

collection of autonomous computers. In a distributed 25 requesting the information. It would therefore be desirable if 

system, the existence of these multiple autonomous com- the host of the client were considered during selection of an 

puters is transparent to the user. To achieve this appropriate web server in order to provide such a redirect, 

transparency, allocation of jobs to processors and all other Moreover, it would be beneficial if the client could easily 

system functions must be automatic. These automated sys- identify the server to which the server request is redirected, 

tern functions are typically provided by an operating system. 30 i n v i ew G f tne above, a system and method for redirecting 

In general, the operating system hides the details of the traffic in a distributed system according to individual server 

hardware from the user and provides the user with a con- capabilities would be desirable. Additionally, it would be 

venient interface for using the system. More particularly, the beneficial if such a system distributed traffic in proportion to 

operating system is responsible for allocating resources 35 individual server portion metrics, 
within the distributed system and schedules the execution of 

various services accordingly. Thus, the operating system SUMMARY OF THE INVENTION 
selects the best processor, locates and transfers all corre- The present invention provides a method and apparatus 
sponding appropriate location. In this manner, the operating f or d istributing server requests among a pluralit y^ servers 
system ensures that system resources such as file servers are ^ m a distributed sy stem. This is accomplished through assign- 
used in an efficient manner. ing a portion metric to each server._ In^this.manner. the 
The resource allocation provided by the operating system port ion metric allows capabilities of e achxandidate-server to 
includes the retrieval and processing of data. Often, this data be taken into consideration during distribution of each server 
is stored on one or more shared file servers. Users in such a request. " " — 

system are called clients. Communication from a client 45 iTaccordance with one aspect of the present invention, 

generally comprises a request message asking for a partial- eacn sei^ern^uestisc^ the 

lar service to performed. The service request message is then Dom ajrNaWSvstem (\TNS) . Configuration is performed 

sent to an appropriate server. The server then does the work m several step s. First, a Domain Name System host name is 

requested and sends back a reply. Thus, data is accessed and associated with a plurality of servers, each one of the servers 

processed by the server in accordance with the service 50 havi ng a unique IP address. By way of example, a DNS table 

request message. on a rj^S server typically includes a plurality of entries, 

In order to send a service request message to a server, the each of the entries containing an IP address-host name 

operating system must first select an appropriate server. association. Second, a portion metric is then assigned to 

Typically, the operating system selects a server according to each one of the plurality of servers. The portion metric 

criteria that may be applied through the use of a metric. By 55 designates a portion of total server requests to be allocated 

way of example, a commonly used metric distributes the to the one of the plurality of servers. As a result, each weh 

service request message to a server closest in distance to the server is assigned a portio n metric. Each portion metric 

client. Accordingly, the operating system may direct traffic designates a portion of total server requests to be allocated 

to a server according to a specified metric. to the corresponding server. Once configuration is 

Although various metrics exist for allocating resources 60 completed, server requests may be allocated, 

within a network, these metrics do not adjust assignment of Eac h server request is separately allocated to a selec ted 

server requests in accordance with the load capacity of each web server. The server request may include a DNS h ost 

server. By way of example, each server may have different nam e query received from a clien t. By way of example, a 

processing capabilities. As yet another example, a server DlNS host name query may inclu de_a_host name to be looked 

may be entirely unavailable. Metrics traditionally used in 65 up and an indication that an IP address is re quested. A 

distributed systems do not adjust assignment of requests plurahly 6TTP addresses associated with the Jbostnam e may 

according to such situations. As a result, these metrics then be obtained, each one of the IP addresses being asso- 
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ciated with one of the plurality of servers. A total number of lion is accepted. A total number of server requests processed 

ser ver requests processed by the plurality of serve rs is by the plurality of servers is incremented. In addition, a 

mcremenied. In addition, a number of server requests dis- number of server req uests distributed to each of the se rvers 

tributed to each one of th e plurality of servers is main tained. is maintai ned. One of the plurality of servers is then selected 

One of_ the_pmr alitv of servers is selected using the p ortion 5 using the portio n metric assigned to each of the se rvers, the 

meu icassigned to each one of the plurality of servers , the nu mber of servexrequests distributed to each pTT Ee servers, 

number"!)! server requests distributed to each one of the and^ the_.tQtaX".numbcr of server request s. An HTTP cod e 

plurality of servers, and the total number of server requ ests. reojrect is then sent to the client. Therefore, information 

AnTF address associatea with the selecte d server is then pToWtedlo Tclient may be selectively varied acc ording to 

pr ovided to the client rrom which the server request w as 1Q ^ Da^nHIIaTHie ^tTe^uesting the informa tion. 

obta ined. Accordingly, wh en each serve r request is re ceived, _ — ; - . . - . - . , . nf 

rTT ; — i^T c in ij = - \ t — ■ rr-^ nr i The advantages of assigning a portion metric to a set of 

one of the pl uralit y of IP addresses is determinecFand . f. . * • * r^trrr^ , -,u 

— ff — , i ■ r r-nf ;— - „ servers and distribu ting a service request in a ccordance with 

provided to the client, in this manner, tramc sent to g eo- -r — ~r- t- : a — ^ nroconi 

r — T . — j . .« , — r 1 — such portion metric assignments are numerous. 1 tie present 

graphically distributed we o servers i n multiple netw orks . fr — «- , f ..... . nf * nnAi 

& — -ti — . . — ■: — -— — j . . * . invention may be used to allow capabilities or each candi- 

may be distributed and monitored. Moreover, the present 1C ~r y . . . , . , *\ . . . AtUr 

• ' — ■ r 3-" f — rr u' n a- . * 15 date server to be taken into consideration during distribution 

invention may be and is p referably g eographically distant _ , - - . ° . 

r it _ . J rp. — v ; l , a , u of each server request. By way of example, portion metric 

from the web servers. Thus, conn ections distributed to each . \ J. > • £ \i u ♦ r 

— r r -.fj — Ti — j r r- u assignments may be used to maximize the throughput ot a 

web server mav be monitored and load averages ol each we b . . . • »• 

r— ^ — r . — . , n distnbuted system having servers with diverse processor 

scngr miy belSpieq^ently queried . SimiUrly. portion metrics may be reassigned upon a 

In accordance with another aspect of the present 20 determination that a server is unavaflable. 
invention, the portion metric is used in combination with 

other metrics. A portion metric is assigned to each one o f a BRIEF DESCRIPTION OF THE DRAWINGS 
p lurality, of servers . The portion metric for each one of the 

plurality of servers is added to obtain a total portion m etric. FIG. 1 is a block diagram illustrating an exemplary 

A number of server reque^CTistg hiJ ^ tn parh nn fi of the 25 distributed system in which the portion metric of the present 

pluTality of servers is maintained. In addition, a total number invention may be implemented. 

of server requests processed by the plurality of servers is FIG. 2 is a block diagram illustrating a distributed system 

incremented. A server request is then received. A server in which the distributed director of the present invention is 

request percentage is computed for each one of the plurality implemented. 

of servers. The server request percentage for each one of the 30 pj G g ow diagram illustrating a method for oper- 

plurality of servers is a product of the number of server atmg tne distributed director in DNS mode according to one 

requests distributed to the one of the plurality of servers and embodiment of the present invention, 

the total portion metric divided by the total number of server nG 4 ^ a flow di illustrating a method for calcu- 

requests received. In addition a metric value is calculated ^ ^ { m for 0Qe of the luralit 

foreachoneofthepuralityofservers^emetovaluefor 35 of ^ {Q ^ embodimeQt of lhe t 

one of the plurality of servers is defined by the server request invention 

percentage for the one of the plurality of servers and the '. 

portion metric for the one of the plurality of servers. The F ? G - s * a flow dl f8 ra f m ^"ting a method for deter- 

metric values for the plurality of servers are compared to mulln S lne melnc value for ° ne ° f the P lurahl y of servers 

obtain a selected server. « a<*°r din 8 to ° ne embodiment of the present invention. 

The selected server may include a set of servers. The set FIG. 6 is a flow diagram illustrating a method for oper- 
of servers may include those servers having an identical ating the distributed director in HTTP redirect mode accord- 
metric value. Alternatively, a tolerance range may be con- m 8 t0 one embodiment of the invention, 
figured for the portion metric. The tolerance range may then DETAILED DESCRIPTION OF THE 
be used in comparing the metric value for each of the 45 PREFERRED EMBODIMENTS 
servers. Thus, a set of the plurality of servers having a metnc 

value within the tolerance range may be obtained. An The present invention provides a method and system for 

alternate metric may then be applied to the set of servers to redirectin g traffic am on g a number of serv ers. This is accom- 

obtain a single selected server. By way of example, the plished through ascertaining the hosJL for the clien t, deter- 

alternate metric may include a distance metric. The selected 50 m jgjng a plurality nf servers associated with the fiost T and 

server is then provided to the client. sel ecting one of the plurality of serve rs. The selection may 

In accordance with yet another aspect of the invention, be based upon defined portion metrics as well as other 

load imbalance may be detected. As described above, a set metrics. Various embodiments of the invention are possible, 

of the plurality of servers is obtained upon application of the By way of example, the present invention may be imple- 

portion metric. When the set of the plurality of servers 55 mented in accord ance with th e Domain_Nam e System as 

includes only one server, this may indicate the presence of well as the HTTP protocol. I n additio n, the invention can be 

load imbalance. By way of example, if there is a large implement ed in numerousjwa ys, including as a method, an 

disparity in the amount of server requests distributed to each apparatus such as a switching element (e.g., router or 

server, there will be one server that has received the least brouter), or a computer readable medium. Several embodi- 

amount of server requests, and this server will therefore be 60 ments of the invention are discussed below, 

selected. A report may be generated indicating the presence Referring first to FIG. 1, a block diagram illustrating an 

of this load imbalance in the distributed system. Moreover, exemplary distributed system in which the portion metric of 

the detection of load imbalance permits reallocation of the present invention may be implemented is presented. A 

servers as well as the reconfiguration of portion metrics and distributed system 10 will typically include one or more 

tolerance ranges. 65 clients 12 and a plurality of servers. As illustrated, the 

In accordance with a further aspect of the invention, a plurality of servers includes a first server 14, a second server 

portion metric is assigned to each server. An HTTP connec- 16, and a third server 18. Typically, a client 12 will send a 
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request for a service which will be distributed to one of the with a plurality of IP addresses corresponding to a plurality 

plurality of servers. To distribute each such service request, of web servers in the distributed system, assigning a portion 

one or more metrics may be used i n the selection process. In metric to each server, assigning other metrics to each server, 

a commoDlyjised metric, the service request is distr ibuted to arJ d specifying a tolerance range for application of each 

the server closest in d istancejo the client. The first server 14 5 specified metric. 

is a first distance 2 (f from the clienfl^Ttne second server 16 A M . DnDTinKT kxctdim-vi •„ „ ■ aA 

. — -. — . — 5 •• — ^ , , . ... A portion metnc PORTION METRIQXI is assigned to 

is a second distance 22 trom the c hentlZ. and the third , r ~. . - t_ v 1 * .u . * 1 

< «- p -.- . . , ,. ^w_" ■ ■ "7" — * - A . each one of the plurality of servers, where X=l to the total 

server 18 is a third distance 24 from the client 12. As shown, . r ^ M c u . • a • 

^ — — ^ ^. ' number of servers N. Each portion metric designates a 

the second distance 22 indicates that the second server 16 is . f .*i , . i_ n . j . Jl 

. . « . . A , • portion of total server requests to be allocated to the corre - 

closest in distance to the client 12. Accordingly, the service 10 ^ , . ? rtf v~ i„ 

... j 1* - c u sponding one 01 the plurality of servers. For example, 

request would be distributed to the second server 16 if such r . 0 . . £n « . f 4 

,. j .I • * . • portion metrics 20%, 32%, and 60% may be assigned to 

a metric were applied. However, in circumstances where f, . . . ™_ 4 . . . , * 

i_ (4L 1 1-* c ' u 1C . • j . three servers, respectively. The portion metrics do not nec- 

each one of the plurality of servers 14, 16, 18 is equidistant j . ♦ . 1 1 r ujjt-u *r 

r • . ■ ^ J , ' , M u . essanly need to total a value of one hundred. Thus, if 

from the client 12, another metric may be used as a tie * . . , t , 

, . „ . . ■ • . , ' ... « percentages have been overassigned as m the previous 

breaker. Moreover, the distance between the client and r 1 *u *u . c • j * 

. t , . , . . . 15 example, then the percentage of server requests assigned to 

seated server may be deemed ins.gmficant m oompram each ^ wm £ tomati * all be ^ down £ each 

to factors such as the processor speed of each available ^ metric ^ ^ ^ % of f > 

server. In such instances, an alternate metric may be desir- . , i . , ~ j r 7 

a ^j e J and retain their relative configured portion of requests. 

3 Referring next to FIG. 2, a block diagram of a distributed 20 According to one embodiment a portion metric is 

system, in which the distributed director of the present t0 each ^ m ^ d ^ U ^^^^ m |^ e 

invention according to one embodiment is presented. A ™S ! host name^ For example, "HOST SERVER! POR- 

distributed system 70 in which the present invention may be TON 32 f co ^ d ^? 6 i0 ™i™*t* ^\^ n ° f ^ 

implemented is illustrated. According to the present requests for the DNS host SERVER1 is 32. These portion 

invention, a distributed director 72 may be provided for each 25 metncs ma y theD te us f t0 determine a percentage of server 

host. As shown, the distributed director 72 is provided for an re£ * uests t0 s f nd t0 ea , ch / e ^ er Morc ° ver ' a default P or r Uo f 

Internet host 74. As will be described, a client 76 connecting meLnc mav * P ro ^ded. By way of example, the default 

to the Internet 74 may wish to connect to a web server portmn metric may be zero. As a result, servers with a higher 

located in another geographical location. By way of P ortlOD m u etnc wlU ' recei u ve » lar S er DUmber of 

example, the San Jose web server 78 and Paris web server 30 r ^ quests than s f^rs with a lower portion metric This ) 

80 are shown. The distributed director 72 determines a host aUows a netw u ork ^mmistrator to adjust server load distn- 

associated with the client 76 (e.g., patents.com). The dis- butl0n across heterogeneous distributed servers, resulting in 

tributed director 72 may be configured to be the primary ™P™ved performance as seen by clients. 

DNS server for a particular host. Thus, once the host isi The portion metric may be used in combination with othe r 

determined, the I P addresses associated witrghis, ho st may 35 metrics nr alnne. tn distribute server requests in a network or 
e asily be obtained. In this manner, available web — ( distributed system to select an annmnriate. server for each 

associated with these IP addresses are made available for ^client Similarly, any of these metrics may be turned off or 

selection. o n to add or remove metrics wfiicfr ^rfct o be conside red. For 

In response to a server request, the distributed director 72 example, a command "PORTION X METRIC2 . . . " may be 

utilizes one or more metrics to select one of these web 40 P rovided to add the portion metric and a second metric, 

servers. Metrics utilized by the distributed director 72 may "METRIC2". Similarly, a command "NO METRIC3" could 

include a variety of metrics, including the distance metric. be used t0 tum off a mird metric > "METRICS". When the 

An information gathering block 82 may be utilized to gather P ortlon metric ™ used m combination with other metrics, the 

metric information such as the distance to each server. The order in which these metrics are considered is specified. By 

inf ormation gathering block 82 mav include a routeniin d 45 w ay of example, the first metric specified. mjhe_cpmmand 

t herefore must have access to all routing, tables relatintU o line may be considered first, followed by, each following 

th T geographically distributed Web servers. _Bv way of metric specified. 

1 exa mple, the distribu teo Ldirector -22L. mav query the Jn fbr- In addition to specifying a portion metric as well as other 

I mation gathering block 82 f or^distance metrics between t he metrics for each server, a tole rance rangej nay be configured, 

j distr ibuted ^servers 78. 80 andlfe -cli*"' 7 fj, Acco rding to one 50 The tolerance range supplies a range used to determine, 

embodiment, the information gathering block 82 may relative to a lowest or highest value, whether any distributed 

include a Direct Response Protocol (DRP) agent, available servers should be equally preferred for a given client. For 

from Cisco Technology, Inc., located in San Jose, Calif. In example, assume there are three servers: SERVER1, 

addition, the distributed director 72 provides a portion SERVER2 and SERVER3 associated with values 100, 119 

metric which allows capabilities of each candidate server to 55 and 125, respectively. If the tolerance value is set to 20 

be taken into c onsideratio n during dis tribution of eac h server percent, SERVER1 (associated with value 100) and 

isgussl. The^stributed director 72 ma^Jherefore direct SERVER2 (associated with value 119) would be equally 

clients to an appropriate server that '^ ^^blosest in dis- preferred since 119 is within the 20 percent tolerance range. 

t ance.J n this manner, the distributed director 72 provides SERVER3 (associated with value 125) would be eliminated 

dynamic, transparent, and scalable Internet traffic load dis- 60 fr° m consideration. 

tribution between multiple geographically dispersed servers. The present invention may be config^redtcjonctjon in a 

Referring next to FIG. 3, a flow diagram illustrating a QNS nap de as we \ \ as, ^ l ^P"re3irect_mode . Additional 

method for operating the distributed director in DNS mode DNS resource records must be added to the primary 

a ccording to on e^emhQclir nent of the_inve ntion is shown. As domain's" primary-DNS-scrv er jo ,identify-the-distributed 

shown, a senacere guestJs dis tributed among a plurality of 65 di rector as the authoritative name server for.a_gi vep host in 

serveril267 first, alstep^28^i the distributed director is DNS^jnode ^r as the actual Werj^ ejxe xrequested by^th e end 

config ured. ConfigTirat ion.may^nclujeassociaTing each host use^in^Fi^ ^P-redirect mod e. Moreover, DNS resource 
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records used by the distributed director may be configured in metrics may be assigned a weight during configuration 

an external s graexjQt^a lterna livel y, i n the distributed direc- indi cating a ma ric_j)riority. If multiple metrics have the 

| tqr__-^ s ame metric p riority, the metrics may be add ed to obta in a 

Next, at step 29, initialization of variables is performed. co mposite me tric. For example, if t wo metrics hav e the same 

During this step, a variable storing a total number of server 5 met ric value , the metri c values may each be multi plied by 

requests processed by the plurality of servers, TOTAL__ their respective weight values (if specified) and thenadded 

REQUESTS, is initialized to a constant. In addition, a together to form the composite metric for the corresponaTng 

variable for each one of the plurality of servers storing a server. According to one embodiment, the default weight 

number of server requests distributed to each one of the values are set to 1. 

plurality of servers, NUMBER_SERVER„REQUESTS[X] 10 The metric value for each one of the plurality of servers 

are each initialized to a constant. According to a preferred are then com pare dJo_choQse_Qje_ar more "selected " server s 

embodiment, the variables are initialized to zero. However, (e.g., a server having a lowest or highest metric valued at ste p 

another constant may be used to avoid the potential problem 40. In the embojimen t^de^DTj^lE&S^ Jl Js_ assumed t hat 

of dividing by zero. the server nayjn^ _the. ,l o>vest„metric. value_J s selected. 

Once the distributed director is configured and initializa- 1S More over, a se t of the plurality of servers having metric 

tion is performed, a server request may then be received at values within the tolerance range may be "selected", 

step 30. By way of example, the server request may include I f multiple server s are determined to have been "selected" 

a DNS host name query that is received from a client. A DNS at step 42 (the metric values associated with the selected" 

host name query typically includes a host name to be looked servers are within the tolerance range), an alternate metric 

up and an indication that an IP address is requested. 20 m ay be used as a "rfe breakefS at step 44. By wav of 

After the server request is received, the portion metrics example, the server closest _in_distance to the client may be 

may be utilized as well as other metrics to select an selected. As yet another example, the next metric ma y be 

appropriate server in response to each server request. usgjLUxselect the "jes t" server. 

According to one embodiment, each server request is dis- 25 Next, at step^46, the IP address associate d with the 

tributed in accordance with the Domain Name System selected -server Is^rovided to the client "from which the 

(DNS). At step 32, a plurality of IP addresses associated with server request was obtained. By way of example, the IP 

the queried host name are obtained- Each one of the plurality address may be returned to the client's local DNS. 

of IP addresses are associated with one of a plurality of Therefore, the server request is distributed using the portion 

servers in the distributed system. In this manner, a set of 3Q metric assigned to each one of the plurality of servers, the 

servers are obtained from w hich one is to be selec ted. A t step number of server requests distr ibuted to each one of the 

34, the portion metrics for^acfTone "ofTEe" plurality of plurality of serv ers, and'theTotal nu^mJier.Qf .seryerrre^ uests 

servers are add ed to obtain a total p ortion m etric, TOTAL_ as ^escribed*abov e. As ^r^uh^t^disuibuted director 

PORTION. efiectfvdy hin^tk^^s*T"DNS server" In tins manner, a 

In order to distribute the server request to an appropriate 35 ^^^^^^^y^^^J^sL^&S^^ with the 

server, metric valuesjQr_each„Qne_of the plurality of servers DNS. 

ar e_compute d. A meldc-vate- for a selected server X is The tolerance range specifiegLfpx^ach^port ultimately 

computed using the number of server requests distributed to determmes^the^freguency that Jhe^portion "metric is used to 

the selected server, the portion metric for the selected server, altocate^eryer requests. By way of example, the first metric 

the total portion metric, and the total number of server 40 to be applied may be the portion metric and the second 

requests processed by the distributed system where meiric,tQ_he_applied^m ay be the jdist a^^nietrjc. If the 

METRIC_VALUE[X] = SERVER_REQUEST_ tolerance range for the portion metric is set to 20 percent, all 

PERCENTAGE[X]-PORTION_METRIC[X]. At step 36, a servers having a metric value within 20 percent of the 

server request percentage is calculated for each one _oi the optimum metric value (e.g., lowest or highest metric value) 

plurality-UxLservers. For each server, the server request 45 will be considered to be equally desirable. In this case, the 

percentage js caj cj ilated ysing the number of server requests second distance metric is applied as a "tie breaker". 

distr^ut^jtO-the^sejver, ^ m"e~toFal"portion metric, and the Accordingly, the second distance metric will be used in all 

total number of server requests received where SERVER_ instances where tw^rjiQie.seryers are within the tolerance 

REQUEST_PERCENTAGE[X]=NUMBER_SERVER_ range of an optimum metric value. As a result, the portion 

REQUESTS[X]*TOTAL_PORTION/TOTAL__ 50 metric is used only in instances where only one server falls 

REQUESTS. Thus, the server request percentage is a within this tol erance range . In other words, the portion 

percentage of totalrequests distributed to a server X which metric wiUTHTimately be used only to correct load imbal- 

-is normalized to the portion^ etne^ignments. Next at ances according to the configured tolerance range, while a 

step 3*C a metrirvalue is delerminedTb? "eacli one of the second metric (e.g., distance metric) will be applied when 

plurality of server s. For each se rver, the metric value is 55 there is no load imbalance . 

determined using the server req uest perc entage for the As described above, the frequency that the portion metric 

server, the portion metric for the server, and thTTfotal number is applied may provide an indication of load imbalance, 

of server requests received. According to one embodiment, Monitoring the application of the portion metric will there - 

the metric value is obtained by subtracting the portion metric fore indicate whether the load apportionment among the 

from the server request percentage, METRIC_VALUE[X]= 60 servers is appropriate. Accordingly, a counter may be incre- 

SERVER_REQUEST_PERCENTAGE[X]-PORTION_ mented each time the portion metric is applied. This counter 

METRICfX]. However, a constant such as TOTAL_ value may then be used to correct any load imbalance 

REQUESTS may be added to this value to ensure that the present in the distributed system. 

METRIC_VALUE [X] is a positive number. since the application of the portion metric (e.g., one 

If multiple metrics have been specified for each server, the 65 server ^is^ sei ected ^ac cord ing Ao the^j flrtiojL. metric') will 

metrics mav_ be combined Jp form a single metric value prior indicate the preser^e^oXaJo^ad^imbalance within the distrib- 

to server selection. In addition, each metric or selected uted system, a report may be generated each time the portion 
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metric is applied. The report may include information plurality of ser\'ersS3^_shown in FIG. 2 according to one 

designed to assist in correcting load imbalance. By way of embodiment of the invention is illustrated^ At step 62, a first 

example, Ihenrvport may include the number of server one of the plurality of servers is selected. As described 

requests distributed to each server. Thus, the report will above, a server may be selected according to an index, X. At 

fiirnish-Thene ^Sflry information to al low a system admin- 5 step 64, the portion metric associated with the selected one 

istralo1nTrr^mT%iir'e s elected por timTme tries and tolerance 0 f the plurality of servers, PORTION_METRIC[X], is 

ranges^as well as modify ihe ^TTonarinn^f the servers By subtracted from the server request percentage for the 

way of example, the geographical distribution of the servers selected one of the plurality of servers, SERVER„ 

may be modified. As yet another example, additional severs REQUEST__PERCENTAGE[X], to obtain a metric value 

may be added to a particular seographicaUocation. 3Q for the selected one of the plurality of servers, METRIC. 

Once the sery^rrequest has been distributed to a server, VALUE[X]. Thus, METRIC_JVALUE[X]=SERVER_ 

aU v ariables aie^pg ted |to reflect the receipt of a new s erver REQUEST_PERCENTAGE[X]-PORTION„METRIC 

mS&SSBSB^&flte^not tne new serveneqitest to [X ]). In additioQ> ^ described above, a constant such as 

the^Ue4^£iy£t. At step «, me number of server requests TOTAL__REQUESTS may be added to this value to ensure 

distributed to the selected server, NUMBER_SERVER_ M-cium \/at Ticrvi ,v « 

REQUESTS[SELECTED SERVER] is increased. In this 15 the METRIC_VAWE[X] is a positwe numter Nex^t 

iL L , £ — J 4 ^-r-ir-r-T3- • step 66, it is determined whether the metnc value for each 

manner, the number of server requests distributed to each r fL i i ■* r L i i * j ^ i 

one of the plurality of servers is maintained. According to ?/ * c P^al^y of servers has been calculated. For example 

one embodiment, the number of server requests distributed * the mdex > X ; 15 e ^ al to _ the number ? f ' dl 

to the selected server is increased in response to step 46. necessary calculations have been performed. However, if the 

Similarly, at step 50, the total number of server requests 20 mdex, X, is les s than the number of servers, the metric value 

processed by the plurality of servers, TOTAL_REQUESTS, musjjbe^cj^ remaining servers. At step 66, if 

is incremented. the^metgi c_ vatoey has not been calculated for each of the 

The above steps may be modified or performed in an servers, the index, X, ^incremented at step 68, and the 

alternate order without departing from the scope of the method loops at step 64 until the metric value for each server 

invention. For example, according to a presently preferred 25 nas ^ een calculated. 

embodiment, the total number of server requests processed According to another embodiment of the invention, the " 

by the plurality of seryej^is.inciemented in response to step d istributed d irector operates in HTTP redirect mo de in 

^Accordingly, the total number of serverjequests is accorda nce witFThe H l u fT r protocol. Referrin g now to FIG, 

increm ented by one . Alternatively, the total number of server 6, a method tor operating the distribute d director in HTTP 

requests may be computed as a sum of the number of s erver 30 redirect mode 84 is presented . The d^girjuTeTBu-ectorls 

requests distributed to each one of the pluralltjTof servers. con^guje^at.stepJ86. Configuration may include associating 

As an otherexa mple, the total numbe xpf serverr equests may each host witrTa plurality o f IP addresses c orresponding to 

be incrementecfiji^esponse to step 29. This woulcl^elimmate a pniraluYo Tw eb serv ej ^nTffie^ ^ 

the possibihty of a tf divide~r5y zero" error since the total mg^portion^metric to^ach server, assigning^omer metrics 

number of server requests would be incremented prior to 35 to each server, and s^ecifiyin^ appli- 

step 36. cation of each specified metric as described above in step 28. 

Referring now to FIG. 4, a flow diagram illustrating a Next, at step 88, an HTTP connection is accepted from a 

method for calculating the server request percentage for client. Thus, the distributed director operates_as,though it 

each of the plurality of servers 36 shown in FIG. 2 according were a web_server. T he distributed director has an IP address 

to an embodiment of the present invention is shown. At step 40 as well as "a different I P address use d by a web r edirector. As 

52, a first one of the .plurahty„of-serversJs_sel6Cted > _For a result, the client may communicate wiifi the web red irector 

example, if the server request percentage for each of the by cpnnertjj|gJtQ^lie-4veb^ 

plurality of servers is stored in an array, each one of the redirector may then redirect c onnections thro ugh the use of 
plurality of servers mav^be^selecte d according to an in jgx- virtual Aveb servers. By way of example, a different host (and 
X. As shown, the first index in such an array may be one. At 45 IP address) i s associated ^with-each set of virtual servers. The 
step 54, the number of serv er requ ests di stributed to the client is then connected to one of a set of virtual we b serve rs 
selected one of the plurality ot servers and~tbe total portion asftocifit flrl w il h th e riijtnKilted director al step M The 
metric are multiplied to obtain a product. Next, at step 56, rii'ent J ^lP_addrp^ thenoblained at step 90 sinc e it is later 
the product is divided by the total number of server requests used to send anHJTP redi rect. A host name (e.g., 
received to obtain the server request percentage for the 50 www.cisco.com) associated with the IP address of the virtual 
selected one of the plurality of servers. According to a web server connected to is then dete rriiined-at.step-92.,This 
preferred embodiment, in order to avoid dividing by zero, it may be pe rformed v ia a DNS server T as described ab ove, 
is determined if the total number of server requests is equal k Once the client is connected to t he virtual web serv er, it 
to zero. If the total number of server requests is equal to zero, is desjrahle^to obtain a sej of IP add ressee from which to 
the server request percentage for the one of the plurality of 55 sele ct an IP adoVess^aM sen d anHIT P code, redire ct. One 
servers is initialized toWmJSfext, at step 58, it is determined p roblem which m ay occur when iTset of IP addresses 
whether the server request percentage for each of the plu- associated with the host name of the virtual server is 
rality of servers has been calculated. For example, if the obtained is that the IE_address of the virtuaJLa telLserver 
index, X, is equal to the number of servers N, all necessary cjMUie<nedJojmj^ Since it would be 
calculations have been performed. However, if the index, X, 60 extremely undesirable to select the v irtual web serv er that 
is less than the number of servers, the server request the client is connected to, a new host name different from the 
percentage must be calculated for the remaining servers. At host name~oTthe virtual server connected to may be con- 
step 58, if the server request percentage has not been structed. At step 93, a new host name associated with the 
calculated for each of the servers, the index, X, is incre- host name determined in step 92 is constructed. By way of 
mented at step 60, and the method continues at step 54. 55 example, a string may be appended to the host name of the 
Referring now to FK^j^, a diagram illustrating a virtual server that the client is connected to. A set of IP 
method for determining J^he^jnetrjc^value^for-each of the addresses of real web servers associated with the new host 
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name are then obtained at step 94. By way of example, the 
real web servers may be associated with the new host name 
in a DNS jerve r. In this manner, the IP address of the virtual 
web server co nnected to may be excludedroT filtered; from 
selection. 

Once the set of IP addresses are obtained, an IP address 
may be selected. As shown in FIG. 6, one of the set of IP 
addresses may be selected according to steps 32 through 44 
described above. At that time, the client's IP address may be 
used in the selection process depending on the metric(s) 
applied. By^wayjofejcample^ke^o^ use the 

client's IP address to ascertain the distance between a server 
and the client. The distributed, director th en sends to the 
client the HlTF status code "30 2 temporarily moved" with 
a URL for the selected server associated with the selected IP 
v address at step 96. The client may then transparently connect 15 
Wo the selected web server specified by this JURL. System 
"variables may then be updated to reflect the server sel ection 
according to steps 48 through 50 described above-.5n thm 
mannejjyjjejn^^ with the server to whicnX 

ithas been ^ redirected trans parently through thr. rlislribulpd 120 
dug£|PX-W£k _server] Accordingly, information provided toir 
client noay^be selectivel y varied acco rding to the particulars 

client requesting the information. J 

fJ The present invention redirects clients to an appropriate 
server according to one or more- metrics. This eliminate s the 25 
n eed for end users to choose a server from a list of possible 
sit es. In^^ ^U^n^^ince Jhe distributed dtirectorlocauzeT^ 
tra ffic lo^ntemetser^^ 
\ accjy&4aexfafflaa^^ costs. '* 

The invention can also KTemBolEu^^ 
code on a computer readable medium. The computer read- 
able medium is any data storage device that can store data 
which can thereafter be read by a computer system. 
Examples of the computer readable medium include read- 
only memory, random-access memory, CD-ROMs, mag- 
netic tape, and optical data storage devices. The computer 
readable medium can also be distributed over a distributed 
system between coupled computer systems so that the 
computer readable code is stored and executed in a distrib- 
uted fashion. 

Although illustrative embodiments and applications of 40 
this invention are shown and described herein, many varia- 
tions and modifications are possible which remain within the 
concept, scope, and spirit of the invention, and these varia- 
tions would become clear to those of ordinary skill in the art 
after perusal of this application. For instance, the present 45 
inventio n is described as bemgjmplementedand capab le of 
o perating^in-DNS mQde. or HlTF red^ ect mode. However, 
ifshould be understood that the invention is not limited to 
such implementations, but instead would equally apply 
regardless of the context and system in which it is imple- 50 
mented. Thus, broadly speaking, the operations described 
above may be used to distribute traffic in a distributed 
system based on predefined server metrics. In addition, the 
above described invention may be stored on a disk drive, a 
hard drive, a floppy disk, a server computer, or a remotely 55 
networked computer. Accordingly, the present embodiments 
are to be considered as illustrative and not restrictive, and 
the invention is not to be limited to the details given herein, 
but may be modified within the scope and equivalents of the 
appended claims. 

What is claimed is: 

1. A method for distributing a service request, the method 
comprising: 

assigning a portion metric to each one of a plurality of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 
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accepting an HTTP connection; 

incrementing a total number of server requests processed 
by the plurality of servers; 

maintaining a number of server requests distributed to 
each one of the plurality of servers; 

selecting one of the plurality of servers using the portion 
metric assigned to each one of the plurality of servers, 
the number of server requests distributed to each one of 
the plurality of servers, and the total number of server 
requests; and 

sending an HTTP code redirect. 

2. A method for distributing a service request, the method 
comprising: 

determining a metric value associated with each one of a 
plurality of servers; 

configuring a tolerance range used in comparing the 
metric value for each one of the plurality of servers; 

comparing the metric value for each one of the plurality 
of servers to obtain a set of the plurality of servers, each 
one of the set of the plurality of servers having a metric 
value within the tolerance range; 

applying an alternate metric to the set of the plurality of 
servers to obtain a selected server when the set of the 
plurality of servers includes more than one server, and 
otherwise establishing that the selected server is the 
obtained set of the plurality of servers; and 

providing the selected server. 

3. A method for distributing a service request, the method 
comprising: 

associating each one of a plurality of servers with a 
Domain Name System host name, each one of the 
plurality of servers having a unique IP address; 

assigning a portion metric to each one of the plurality of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 

receiving a server request, the server request being a 
Domain Name System host name query including the 
Domain Name System host name; 

incrementing a total number of server requests processed 
by the plurality of servers; 

maintaining a number of server requests distributed to 
each one of the plurality of servers; 

selecting one of the plurality of servers using the portion 
metric assigned to each one of the plurality of servers, 
the number of server requests distributed to each one of 
the plurality of servers, and the total number of server 
requests; and 

providing an IP address associated with the selected one 
of the plurality of servers. 

4. The method as recited in claim 3, wherein selecting one 
of the plurality of servers further includes: 

computing a metric value for each one of the plurality of 
servers using the number of server requests distributed 
to each one of the plurality of servers, the portion 
metric assigned to each one of the plurality of servers, 
and the total number of server requests processed; and 

comparing the metric value for each one of the plurality 
of servers to determine a selected server. 

5. The method as recited in claim 4, wherein maintaining 
the number of server requests distributed to each one of the 
plurality of servers further includes: 

increasing the number of server requests distributed for 
the selected server. 
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6. The method as recited in claim 4, wherein the selected 
server has a lowest metric value. 

7. The method as recited in claim 4, wherein the selected 
server has a highest metric value. 

8. The method as recited in claim 5, wherein increasing 5 
the number of server requests distributed is performed in 
response to distributing the server request. 

9. The method as recited in claim 3, wherein incrementing 
the total number of server requests further includes: 

incrementing the total number of server requests in 10 
response to receiving the server request. 

10. The method as recited in claim 3, wherein increment- 
ing the total number of server requests further includes: 

incrementing the total number of server requests in 

response to distributing the server request. 15 
U. The method as recited in claim 3, further including: 
if more than one server is selected, applying an alternate 
metric to a set of the plurality of servers to obtain a 
selected server. 2Q 

12. The method as recited in claim 4, further including: 
configuring a tolerance range used in comparing the 

metric value for each one of the plurality of servers. 

13. The method as recited in claim 12, wherein comparing 
the metric value for each one of the plurality of servers 2 s 
further includes: 

defining a set of the plurality of servers having metric 
values within the tolerance range and including the 
selected one of the plurality of servers; and 

applying an alternate metric to the set of the plurality of 30 
servers to obtain a selected one of the plurality of 
servers when the set of the plurality of servers includes 
more than one server, and otherwise establishing that 
the selected one of the plurality of servers is the set of 
the plurality of servers. 35 

14. The method as recited in claim 12, wherein comparing 
the metric value for each one of the plurality of servers 
further includes: 

obtaining a set of the plurality of servers, each one of the 
set of the plurality of servers having a metric value 40 
within the tolerance range; and 

applying an alternate metric to the set of the plurality of 
servers to obtain a selected one of the plurality of 
servers when the set of the plurality of servers includes 
more than one server, and otherwise establishing that 45 
the selected one of the plurality of servers is the set of 
the plurality of servers. 

15. The method as recited in claim 4, wherein computing 
the metric value for each one of the plurality of servers 
further includes: 50 

adding the portion metric for each one of the plurality of 
servers to obtain a total portion metric; 

calculating a server request percentage for each one of the 
plurality of servers using the number of server requests 55 
distributed to each one of the plurality of servers, the 
total portion metric, and the total number of server 
requests received; and 

determining a metric value for each one of the plurality of 
servers using the server request percentage distributed go 
to each one of the plurality of servers and the portion 
metric assigned to each one of the plurality of servers. 

16. The method as recited in claim 15, wherein calculating 
the server request percentage for each one of the plurality of 
servers further includes: 65 

calculating a server request percentage for one of the 
plurality of servers using the number of server requests 
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for the one of the plurality of servers, the total portion 
metric, and the total number of server requests 
received. 

17. The method as recited in claim 16, wherein calculating 
the server request percentage for the one of the plurality of 
servers includes: 

multiplying the number of server requests distributed to 

the one of the plurality of servers and the total portion 

metric to obtain a product; and 
dividing the product by the total number of server 

requests received to obtain a server request percentage 

for the one of the plurality of servers. 

18. The method as recited in claim 15, wherein determin- 
ing the metric value for each one of the plurality of servers 
further includes: 

determining a metric value for one of the plurality of 
servers using the server request percentage for the one 
of the plurality of servers and the portion metric for the 
one of the plurality of servers. 

19. The method as recited in claim 18, wherein determin- 
ing the metric value for the one of the plurality of servers 
further includes: 

subtracting the portion metric for the one of the plurality 
of servers from the server request percentage for the 
one of the plurality of servers to obtain a metric value 
for the one of the plurality of servers. 

20. The method as recited in claim 18, wherein determin- 
ing the metric value for the one of the plurality of servers 
further includes: 

subtracting the server request percentage for the one of 
the plurality of servers from the portion metric for one 
of the plurality of servers to obtain a metric value for 
the one of the plurality of servers. 

21. The method as recited in claim 17, wherein dividing 
the product by the total number of server requests further 
includes: 

if the total number of server requests is equal to zero, 
initializing the server request percentage for the one of 
the plurality of servers to a value of zero; and 

if the total number of server requests is greater than zero, 
dividing the product by the total number of server 
requests received to obtain the server request percent- 
age for the one of the plurality of servers. 

22. The method as recited in claim 15, wherein calculating 
the server request percentage for each one of the plurality of 
servers further includes: 

assigning a value of zero to the server request percentage 
for each one of the plurality of servers when the total 
number of server requests is equal to zero. 

23. The method as recited in claim 3, further including: 
initializing the total number of server requests and the 

number of server requests distributed to each one of the 
plurality of servers to a constant. 

24. The method as recited in claim 23, wherein the 
constant is zero. 

25. The method as recited in claim 3, wherein increment- 
ing the total number of server requests further includes: 

adding the number of server requests distributed to each 
one of the plurality of servers to obtain the total number 
of server requests. 

26. A method for distributing a service request, the 
method comprising: 

assigning a portion metric to each one of a plurality of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 
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adding the portion metric for each one of the plurality of 
servers to obtain a total portion metric; 

maintaining a number of server requests distributed to 
each one of the plurality of servers; 

receiving a server request; 5 

incrementing a total number of server requests processed 
by the plurality of servers; 

computing a server request percentage for each one of the 
plurality of servers, the server request percentage for 
one of the plurality of servers being a product of the 
number of server requests distributed to the one of the 
plurality of servers and the total portion metric divided 
by the total number of server requests received; 

calculating a metric value for each one of the plurality of 15 
servers, the metric value for one of the plurality of 
servers being defined by the server request percentage 
for the one of the plurality of servers and the portion 
metric for the one of the plurality of servers; 

comparing the metric value for each one of the plurality 20 
of servers to obtain a selected server; 

applying an alternate metric to the selected server when 
the selected server includes more than one server; and 

providing the selected server. 

27. The method as recited in claim 26, wherein the 25 
alternate metric is a distance metric. 

28. The method as recited in claim 26, further including: 
turning a plurality of metrics on, the plurality of metrics 

including the alternate metric. 

29. The method as recited in claim 26, further including: 30 
specifying an order in which each of the plurality of 

metrics is considered. 

30. The method as recited in claim 26, further including: 
specifying a priority for each one of the plurality of 35 

metrics. 

31. The method as recited in claim 30, wherein specifying 
a priority further includes: 

assigning a weight indicating a metric priority to one of 
the plurality of metrics. 40 

32. A method for distributing a service request, the 
method comprising: 

assigning a portion metric to each one of a plurality of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 45 
of servers; 

adding the portion metric for each one of the plurality of 

servers to obtain a total portion metric; 
maintaining a number of server requests distributed to 

each one of the plurality of servers; 50 
receiving a server request; 

incrementing a total number of server requests processed 
by the plurality of servers; 

computing a server request percentage for each one of the 55 
plurality of servers, the server request percentage for 
one of the plurality of servers being a product of the 
number of server requests distributed to the one of the 
plurality of servers and the total portion metric divided 
by the total number of server requests received; 60 

calculating a metric value for each one of the plurality of 
servers, the metric value for one of the plurality of 
servers being defined by the server request percentage 
for the one of the plurality of servers and the portion 
metric for the one of the plurality of servers; 65 

configuring a tolerance range used in comparing the 
metric value for each one of the plurality of servers; 
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comparing the metric value for each one of the plurality 
of servers to obtain a set of the plurality of servers, each 
one of the set of the plurality of servers having a metric 
value within the tolerance range; 

applying an alternate metric to the set of the plurality of 
servers to obtain a selected server when the set of the 
plurality of servers includes more than one server, and 
otherwise establishing that the selected server is the 
obtained set of the plurality of servers; and 

providing the selected server. 

33. A method for detecting load imbalance within a 
distributed system, the method comprising: 

assigning a portion metric to each one of a plurality of 
servers, the portion metric designating a portion of total 
server requests to be allocated to the one of the plurality 
of servers; 

adding the portion metric for each one of the plurality of 
servers to obtain a total portion metric; 

maintaining a number of server requests distributed to 
each one of the plurality of servers; 

receiving a server request; 

incrementing a total number of server requests processed 
by the plurality of servers; 

computing a server request percentage for each one of the 
plurality of servers, the server request percentage for 
one of the plurality of servers being a product of the 
number of server requests distributed to the one of the 
plurality of servers and the total portion metric divided 
by the total number of server requests received; 

calculating a metric value for each one of the plurality of 
servers, the metric value for one of the plurality of 
servers being defined by the server request percentage 
for the one of the plurality of servers and the portion 
metric for the one of the plurality of servers; 

configuring a tolerance range used in comparing the 
metric value for each one of the plurality of servers; 

comparing the metric value for each one of the plurality 
of servers to obtain a set of the plurality of servers, each 
one of the set of the plurality of servers having a metric 
value within the tolerance range; 

detecting load imbalance when the set of the plurality of 
servers includes only one server; and 

applying an alternate metric to the set of the plurality of 
servers to obtain a selected server when the set of the 
plurality of servers includes more than one server, and 
otherwise establishing that the selected server is the 
obtained set of the plurality of servers. 

34. The method as recited in claim 33, further including: 
generating a report in response to the detection of load 

imbalance, the report including information designed to 
assist in correcting load imbalance. 

35. The method as recited in claim 34, wherein the report 
includes the number of server requests distributed to each 
one of the plurality of servers. 

36. The method as recited in claim 33, further including: 
reassigning portion metrics for selected ones of the plu- 
rality of servers in response to the detection of load 
imbalance. 

37. The method as recited in claim 33, further including: 
reconfiguring the tolerance range in response to the detec- 
tion of load imbalance. 
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